Dimensional reduction for phylogenetic tree models
نویسنده
چکیده
We present a method of dimensional reduction for the general Markov model of sequence evolution on a phylogenetic tree. We show that taking certain linear combinations of the associated random variables (site pattern counts) reduces the dimensionality of the model from exponential in the number of extant taxa, to quadratic in the number of taxa, while retaining the ability to statistically identify phylogenetic divergence events. A key feature is the identification of an invariant subspace which depends only bilinearly on the model parameters, in contrast to the usual multi-linear dependence in the full space. We discuss potential applications including the computation of split (edge) weights on phylogenetic trees from observed sequence data.
منابع مشابه
Quantitative Comparison of Tree Pairs Resulted from Gene and Protein Phylogenetic Trees for Sulfite Reductase Flavoprotein Alpha-Component and 5S rRNA and Taxonomic Trees in Selected Bacterial Species
Introduction: FAD is the cofactor of FAD-FR protein family. Sulfite reductase flavoprotein alpha-component is one of the main enzymes of this family. Based on applications of this enzyme in biotechnology and industry, it was chosen as the subject of evolutionary studies in 19 specific species. Method: Gene and protein sequences of sulfite reductase flavoprotein alpha-component, 5S rRNA sequence...
متن کاملQuantitative Comparison of Tree Pairs Resulted from Gene and Protein Phylogenetic Trees for Sulfite Reductase Flavoprotein Alpha-Component and 5S rRNA and Taxonomic Trees in Selected Bacterial Species
Introduction: FAD is the cofactor of FAD-FR protein family. Sulfite reductase flavoprotein alpha-component is one of the main enzymes of this family. Based on applications of this enzyme in biotechnology and industry, it was chosen as the subject of evolutionary studies in 19 specific species. Method: Gene and protein sequences of sulfite reductase flavoprotein alpha-component, 5S rRNA sequence...
متن کاملDirect Molecular Detection and Phylogenetic Tree Analysis of Gastrointestinal Protozoan Parasites (Giardia lamblia, Entamoeba histolytica, Cryptosporidium parvum) from Diarrhea Infection in Kut City of Iraq: A Short Communication
Background: The intestinal tract of human can be infected by protozoan parasites. In this short communication, the stool samples were collected from patients with diarrhea referred to Kut hospital, Iraq, and then the parasites (Giardia lamblia, Entamoeba histolytica, Cryptosporidium parvum) were considered for molecular identification. Methods: Stool samples were collected from 69 patients wit...
متن کاملOn Determining if Tree-based Networks Contain Fixed Trees
We address an open question of Francis and Steel about phylogenetic networks and trees. They give a polynomial time algorithm to decide if a phylogenetic network, N, is tree-based and pose the problem: given a fixed tree T and network N, is N based on T? We show that it is [Formula: see text]-hard to decide, by reduction from 3-Dimensional Matching (3DM) and further that the problem is fixed-pa...
متن کاملAn Auto-Validating, Trans-Dimensional, Universal Rejection Sampler for Locally Lipschitz Arithmetical Expressions
We introduce a trans-dimensional extension of the rejection sampler of von Neumann. Our interval analytic construction of the rejection sampler provides a universal method that is capable of producing exact samples from a large class of trans-dimensional target densities with locally Lipschitz arithmetical expressions. We illustrate the efficiency of the sampler by theory and by examples in up ...
متن کامل